This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 



BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the 
original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

• SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problems Mailbox. 



PA8E BLAH* 



(USPTO) 



(19) 



J 



Europalsches Patentamt 
European Patent Office 
Offic uropeen d sbr vts 




(11) 



EP 1 050 805 A1 



(12) 



EUROPEAN PATENT APPLICATION 



(43) Date of publication: 

08.11.2000 Bulletin 2000/45 

(21) Application number: 99410055.0 

(22) Date of filing: 03.05.1999 


(51) lnt-CI.7; G06F 9/38 

f" . * ... 

; • l<- . 


(84) Designated Contracting States: 


(72) Inventors: 


AT BE CH CY DE DK ES PI FR GB GR IE IT LI LU 


• Uguen, Laurent 


MC NL PT SE 


, 38000 Grenoble (FR) 


Designated Extension States: 


• Ferroussat, Sebastien 


AL LT LV MK RO SI 


38600 Fontaine (FR) 


(71) Applicant ST Microelectronics SA 


(74) Representative: Driver, Virginia Rozanne 


94250 Gentilly Cedex (FR) 


Page White & Farrer 




54 Doughty Street , 




London WC1N 2LS (GB) 



(54) Transfer of guard values in a computer /system 



(57) A method of transferring guard values and a 
computer system utilising the method are described. In 
a computer system using predicated instructor' exScu- 
tion, predicated on the values of particular guard ihbfcai * 
tors, a master set of guard indicators is held! The com- 
puter system has a parallel sot of execution units, and 
the master set is held in association with one of the ex- 
ecution units. If other execution units require the guard 



values for particular guard indicators, a sendguard in- 
struction is issued to the execution unit holding the mas- 
ter guard values. The sendgciard instructions are held 
> in a separate queueifrom the main instructions intended 
for that execution uriit. Moreover, circuitry is provided in 
the execution unit to avoid stalling in the despatch of 
sendguard instructions even in the context of earlier 
guard modifying instructions. 
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Des ription 

[0001] The present invention relates to a method of 
transferring guard values and to a computer system uti- 
lising the method, s 
[0002] Computer systems can operate in which in- 
structions include a guard indicator so that the execution 
or otherwise of the instruction is dependent upon the val- 
ue of the guard indicator. Problems do however arise 
when a plurality of instructions are executed simultane- 10 
ously in a plurality of parallel execution units. This may • 
arise in computer systems operating in superscalar or 
very long instruction word (VLIW) instruction modes^so 
that the execution of some instructions may affect the 
guard values to be associated with other instructions in 15 
parallel execution units. Problems may also arise from 
pipelined execution of instructions so that the execution , 
of some later instructions commences prior to the com- 
pletion of execution of earlier instructions in the pipeline. 
[0003] In a computer syslem which requires the trans- zo 
fer of guard values between parallel execution units, it 
is important that the. transfer of guard values is accom- . 
plished as promptly and efficiently . as possible. This 
however should be done without transferring incorrect • 
guard values and, preferably, without stalling tho opor- 2S 
ation of the pipelined execution units. 
[0004] Guarded instruction execution can be referred 
to as predicated instruction execution. 
[0005] According to one aspect of the present inven- 
tion there is provided a computer system for executing. 30 
instructions having assigned guard indicators, which 
system comprises instruction supply circuitry, a plurality 
of parallel execution .units for receiving respective in- 
structions from the supply circuitry, each instruction hav- 
ing a respective guard indicator selected from a set of 3S 
guard indicators, common to the plurality of execution 
units, one of said execution units including a master 
guard value store containing a master representation of 
current values for the guard indicators in said set of 
guard indicators and guard value transfer circuitry oper- *o 
able to transfer a guard value from said master.store to 
another of said execution units in response to a s end- 
guard instructbn being executed in said one execution 
unit, wherein the instruction supply circuitry comprises 
a main instruction queue for holding instructions to. be 
supplied to the parallel execution units and a subsidiary 
instruction queue for holding sendguard instructions, 
with Ihe subsidiary queue having priority access, to the 
execution pipelines to avoid unnecessary delays for ex- 
ecution of sendguard instructions. , r so 
[0006] According to afnother aspect of the present in- 
vention there is provided a method of executing instruc- 
tions in a computer system, said instructions having as- 
signed guard indicators, which method compris s sup- 
plying a plurality of instructions to parall I execution ss 
units, each instruction having a respective guard indica- 
tor selected from a set of guard indicators common to 
the plurality of execution units, holding a master set of 



current values for the guard indicators in one execution 
unit; effecting a transfer of the master guard value from 
said one. execution unit in response to a sendguard in- 
struction being executed in said one execution unit 
wherein instructions to be supplied to the parallel exe- 
cution unit, save for said sendguard instructions, are 
held in a main instruction queue and wherein said send- 
guard instructions are held in a subsidiary queue having 
priority access to the execution pipelines to avoid un- 
necessary delays for execution of sendguard instruc- 
tions. 

[0007] Thus there is a separate queue for sendguard 
instructions which has priority access to the execution 
pipeline. If this is done in the context of using guard val- 
ue, modifying instructions which can modify guard val- 
ues and which are held in the main instruction queue, it 
is important to check before each sendguard instruction 
is supplied to the execution pipeline that no earlier guard . 
value modifying instructions affecting the guard value 
requested by the sendguard instruction is still waiting in 
the main instruction queue. 

[0008] According to a further aspect of the invention 
there is provided a pipelined execution unit for a com- 
puter system which comprises at least two pipelined 
stages, with an earlier one of tho pipelined stages in- 
cluding sendguard circuitry responsive to a sendguard 
instruction to despatch a guard value for a guard indi- 
cator, defined in the sendguard instruction, and a later 
one of the pipelined stages including guard value mod- 
ifying circuitry for executing : guard value modifying in- 
structions which cause the value of a guard indicator to 
be modified, the pipelined execution unit further com- 
prising: a master guard value store for holding a master 
representation of current values for guard indicators; 
and means for determining any dependencies between 
a sendguard instruction in the earlier pipelined stage 
and a guard modifying instruction in the later pipelined 
stage; and switching circuitry for selecting when a send 
guard instruction is executed and responsive to any 
such dependencies, whether a guard value held in the 
master guard value store or a guard value just modified 
by the guard value : modifying circuitry is to be des- 
patched. 

[0009] According to a further aspect of the invention 
thece is provided a method of executing instructions in 
a pipelined execution unit, each instruction having a re- 
spective guard indicator selected from a set of guard in- 
dicators and execution of the instructions being predi- 
cated on the values of the guard indicators, wherein th 
instructions include a sendguard instruction which, 
when, executed, causes the transfer of a guard value 
from the pipelined execution unit and a guard value 
modifying instruction which -modifies tho valuo of a 
guard indicator, the method comprising: supplying in- 
structions' to the pipelined execution unit including said 
sendguard instructions and said guard valu modifying 
instructions; checking d pendencies between guard 
value modifying instructions supplied to the pipelined 
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execution unit earlier than a sendguard instruction fe- - 
lating to the same guard indicator; and supplying the * - 
guard value of the guard indicator requested in the send- v 
guard instruction selectively from a master guard value 
store or guard value modifying circuitry in dependence s 
on the results of said dependency checks, so as to en- 
sure that the guard. value of a guard indicator which is 
despatched responsive to a sendguard 'instruction is ■* 
correct in relation to any earlier guard value modifying 
instructions in the pipelined execution unit. • ^ ' to 

[001 0] According to the embodiment described here- ; 
in, each execution unit comprises a plurality of pipelined 
stages with synchronised pipeline cycles for each of the 
execution units. Circuitry is provided to allow for prompt " 
despatch of a guard value requested by a sendguard is 
nnicroinstruction, regardless of its dependency on earlier 
guard value modifying instructions in the pipeline, and 
without stalling the pipelines 

[0011] For a better understanding of the present in- 
vention and to show how the same may be carried into* 20 
effect, reference will now be made by way of example 
to the accompanying drawings in which:- " 1 { ir ] 

».• • f • "i 

Figure 1 shows a computer system in Accordance 
with one embodiment of tho invention;- ' : " v • % s 
Figure 2 shows the layout of the instruction queue^;^' 
Figure 3 is a diagram illustrating circuitry intfie pipe^- 
line to avoid stalling when executing sendguard in- 
struct ions; and - " . .vjiu oy- 
Figure 4 is a partial diagram of a microinstruction- 3& 



format. 




[0012] The computer system of this example is ar- 
ranged forthe parallel execution of a plurality of instruc- 
tions and is particularly suited to providing a high digital 35 
signal processing (DSP) performance. Instructions are 
held in a program memory 11 and after passing trough 
a control unit 1 2 are supplied to four parallel execution 
pipelines 13,14,15 and 16. Pipelines 13 and 14 are 
shown as slot 0 and slot 1 of a data unit arranged -to 40 
execute instructions carrying arithmetic operations. 
Pipelines 1 5 and 1 6 are shown as slot 0 and slot 1 of ah 
address unit 1 9 used to execute instructions for memory 
accesses to a data memory 20. Slot 1 or Slot 0 of the 
address unit 1 9 may also be used to supply instructions *s 
to a general unit 21 which shares some resources with 
the address unit 19. The general unit 21 includes a Con- 
trol register file 22 as well as branch circuitry which is 
used to provide instruction branch information on line 
23 to the control unit 12. so 
[0013] The two pipelines 1 3 and 14 in the data unit 1 8 
share a common data register file 26 and a common 
guard rogistor filo 27 holding tho master guard valuos 
which may be associat d with the instructions. Th two 
pipelin s also derive instructions from a common in- ss 
struction dispatch qu ue (D-IDQ) 29 in the data unit 18 
and instructions in the queue 29 are checked for data 
dependency by common vertical dependency check cir- 



cuitry 30 in the data unit 18. The sequence of operations 
in each of the pipeline stages in the data unit 1 8 include 
arvoperand fetch usually accessing one of the register 
files 26 or 27 followed by two execution stages which 
may use arithmetic circuitry 33 followed by a data write 
stage at which the result of an arithmetic operation is 
returned to the 1 register file 26 and 27; A similar pipeline 
exists for both pipelines 1 S and 1 4 on* the data unit 18. 
[0014] Similarly for the address unit 1 9, both pipelines 
15 and 16 access a common register file 40 holding 
pointer values for use in load or store operations in the 
data memory 20. The two pipelines each take their in- 
structions from a common instruction dispatch queue 
(A-IDQ) 41 and a similar vertical dependency check is 
provided in common for both pipelines 15 and 16 in the 
address unit 19. The vertical dependency check is sim- 
ilar to that'already referred to in the data unit 18. In ex- 
ecuting instructions through the two pipelines 1 5 and 1 6, 
accesses will be made to the register file 40. Add and 
subtract units 43 may be used in execution of the in- 
structions: Each of the pipelines 15 and 16 in the ad- 
dress unit 19 includes an operand fetch followed by an 
execution stage and ah address write back stage. 
[0015] Both the data unit 18 and the address unit 19 
are connected to tho data memory 20 through a data 
memory interface control 50. The control 50 is connect- 
ed by buses 53 to both the data unit 1 8 and the- address 
unit 19. The interface' control 50 includes a plurality of 
queues each connected to the buses 53. These queues 
include ' load data queues (LDQ) 60 for data awaiting 
loading into register files of the data unit 1 8 or address 
unit 1 9i The interface control 50 aiso includes store data 
queues (SDQ) 70 fortiSta awaiting storage in the mem- 
ory 20. Store address queues (SAQ) 71 are provided to 
indicate the locations in the memory 20 at which the data 
is to be stored. -» - 

[0016] It will be understood that when instructions are 
executed to loafd data from the data memory 20 into the 
data register files of the data unit 18, the address unit 
19 will access the data memory 20 and load the required 
data into the load data queues 60 prior to completing 
the update of the data register file 26 by transferring the 
data from the appropriate queue 60. Similarly when in- 
structions are' executed to store data from the dstta unit 
18 into the data memory 20 the appropriate data may 
be- held in the store data queues 70 together with the 
store addresses in queue 71 prior to completing the 
store operation in the dala memory 20. 
[0017] By ^executing the memory addressing instruc- 
tion in the address unit 19 in separate parallel pipelines 
from those provided in the data unit 18, the computer 
system operates access decoupling in that the memory 
accesses arc effected indepondontty of tho arithmetic 
operations carried out with ih the data unit 18. This re- 
duces the problem of m mory lat ncy. In a digital signal 
processing system which operates regular and repeat- 
ed operations the memory latency can be hidden from 
the executing program. * - ■ 
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[001 8] In iho above description, alt instructions which 
are fed ihrough pipelines 13, 14, 15 and 16 are subject 
to a vertical dependency check and if any data depend- 
ency is (ound which cannot be resolved by a bypass, 
the execution unit operates to cause a temporary stall 
in one of the pair of pipelines 13 or 14 or in the pair 15 
and 16 so as to cause a temporary delay in one of the 
pipelines of the pair so as to resolve the data depend- 
ency. It will be understood that a check for a data de- , 
pendency includes any form of data, including data rep- 
resenting guard values.' The circuitry described in the ... 
following, however, reduces the likelihood of stalling on 
guard dependencies. 

[0019] The control unit 12 is also arranged to provide 
a horizontal dependency check. In this specification a 
data dependency between instructions that are supplied 
to the parallel pipelines in the same machine cycle is 
referred to as a horizontal dependency. The control unit 
12 includes a. program counter and address generator 
80 to provide a memory address for an instruction fetch 
operation from the prog ram memory 11 . The machine 
may operate tn a selected one of a plurality of instruction 
modes including superscalar modes of variable instruc- 
tion bit length or in very long instruction word (VLIW). 
mode. The control unit 12 may include a process status 
register to indicate the instruction/node in which the ma- 
chine is operating. 

[0020] In use, a plurality of instructions are obtained 
from the memory 11 in a single fetch operation during 
one cycle and are decoded by a. decoder 82 in the con- 
trol unit 12. They are checked for horizontal data de- 
pendency by dependency checking circuitry 87 to indi- 
cate if a horizontal data dependency has been located. 
After decoding, the instructions are used to generate mi- 
croinstructions for each of the execution .pipelines. The 
instructions from the decoder 82 are passed to a micro- 
instruction generator 98 which generates a plurality of 
parallel microinstructions which areputput by a dispatch 
circuitry 99 through parallel paths 1 00 to the four parallel 
execution pipelines 1 3, 1 4, 1 5 and 1 6 and for the general 
unit 21 * If a horizontal dependency waslocated, the mi- 
croinstructions on lines 100 woub include an indication 
to the instruction despatch queues of the data unit 18 or 
address unit 19 that some action, , such .as a pipeline ~ 
stall, was necessary in the execution, pipelines to re- 
solve the horizontal dependency, 
[0021] In this example, each instruction may. be pro- 
vided with a guard indicator G between GO and G15. If 
a guard is true, then the instruction is executed (i.e. up- 
dates the architectural state of machine); if a guard is 
false the arch rtectural 'state of machine is not changed 
by instruction execution. Resolution of a guard may be 
dono in different pi'polinc stages of tho machine. 
[0022]. Th guard for each instruction may b selected 
between GO and G1 5 and in this particular xample the 
guard G1 5 is always true. The valu true or false attrib- 
uted to guards G0-G1 4 is however dependent upon the 
guard values held at any particular tirne in a guard reg- 



ister file. The master guard register file in this xample 
is guard register file 27 (sometimes referred to herein 
as th architectural guard register) in the data unit 18. 
However, a supplementary or shadow guard register file 

s (normally copies of the master file 27) is provided by a 
control, unit guard register file 101 in the control unit 12. 
The control unit 12 also includes a register 102 to indi-. 
cate which unit is currently known to be the guard owner 
in respect of each guard indicator. Register 102 has a 

10 first bit 1 03 for each guard indicator which if holding the 
value 1 indicates that the address unit 1 9 is currently the 
guard holder for that indicator. If bit 104 for each guard 
indicator is set to the value 1 then it indicates that the. 
control unit 12 is currently the guard owner forthat guard 

15 indicator.. If neither bit 103 nor 104 is set then the default 
condition indicates that the master guard register file 27 
must be used so that the data unit 1 8 is the guard owner. 
The guard values held in the guard register files can be 
changed by a guard modifiennstruction (GMI) executed 

20 by the data unit 18 or the address unit 1 9. The address 
unit also has a shadow guard register file 1 00 which may 
be updated by guard modifier instructions executed by 
the address unit 19. Instructions executed by the data 
unit will update the master file 27 and the shadow fil 

25. 101 in the control unit. Those executed by tho address 
unit will update the shadow file 100 and the master file 
27 (and hence the shadow file 101 in the control unit). 
[0023] , in, normal operation the guard register file 27 
in the data, unit 1 8 maintains the architectural state of 

30 the guard values GO to G14 and the register file is com- 
mon to both execution pipelines 13 and 14. The opera- 
tive values of the guards are the same for all execution 
pipelines although.as will be explained below, the differ- 
ent pipelines may access different register files to obtain 

as the guard values. 

[0024] In this example the master register file for the 
guard values is held in the data unit 18 as it is the data 
unit.that will most commonly execute instructions likely 
to change the value of the guards. Greater efficiency of 

*o execution cycles is therefore achieved by maintaining 
the master guard values in the register file which is di- 
rectly accessed by execution of the instructions in the 
data unit 18. When instructions are fed through either 
slot 0 or slot 1 of the data unit 18 the required guard 

4s value may be taken directly from the master guard reg- 
ister file 27 in accordance with the guard indicator that 
accompanied the microinstructions fed into the data unit 
18 from the control unit 12, unless the control unit 12 is 
the owner of the guard in which case the guard value 

so will have been taken ifrom the shadow registers 101 in 
the control unit 12. . 

[0025] In the case of instructions to the address unit 
19, the moro general position will bo the dofault condi- 
tion in th guard own r register 1 02 indicating that guard 
55 ownership does not belong to the address unit 1 9 or the 
control unit 12 and consequently th guard values re- 
quired for execution of the instructions in the address 
unit 1 9 will ne d to be obtained from the guard register 
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file 27 in the data unit 18. The microinstructions fed 5 
through lines 100 to the execution units will include sup- 1 
plying a "sendguard" (sndg) instruction to the data' unit 
1 B as the same time as supplying the appropriate mi- 
croinstruction to the correct slot of the address unit 1 9. s 
The "sendguard" instruction will be initially slotted into 
the instruction dispatch queue 29 of the data unit 18 in 
the same cycle of operations as the microinstruction re- 
quired for the address unit 19 is slotted into the instruc- 
tion dispatch queue 41 for the address unit. All micro- 10 
instructions in a given execution unit are always execut- 
ed in order and all guard manipulations and transfers 
are maintained in order with respect to these micro-in-' 
structions. This guarantees the synchfonicity of guard 
transfers (i.e. for every guard emission from a given ex- is 
ecution unit there is an opposite guard reception in an- 
other execution unit and all these iare done in order. The 
control unit has responsibility to generate the correermi- 
crb-instructions for guard transfers; the sending or re- 
ceiving execution unit only sees the send or receive (re- 20 * 
spectively) micro-instruction i.e. the action that it'must 
do). In this way the correct sequencing dccute J wjtH the 
correct guard value being obtained from the guard IVeg- ' 
ister file 27 corresponding to the instruction being exe- ; 
cutod in tho addross unit 19. Tho supply 6f trio *send- 2s 
guard" instruction in such a situation is illustrated at 1 10 
in the drawing. As described in the following, a bypass* " 
mechani'srn-fbr sendguard instructions is implemented • 
which nevertheless respects the requirement 'for syn- ! 
chrohicity of guard transfers. ' ' r " A :_ " 30 

[0026] The address : unit 19 has a queue 'of irisVuc- 
tions 41 awaiting dispatch tbthe execution units; ! lt also 
has a queue 111 (ARLQ) of items awaiting' loadirig into 
the pointer or control registers 40 or 22. there is also 
the queue 71 in the memory interface cdntrol SOdf 'store 35 
addresses queues awaiting a memory access as a re- : 
suit of partial execution of a store instruction irvthe ad- 
dress unit 19. When the address unit 19 awaits a {Juard 
transfer from the data unit 18, the instruction in the ad- 
dress unit 1 9 is stalled in the A-IDQ 41 or in the ARLQ 40 
111 or in the store address queue 71 until the requested 
guard value is transmitted from the data unit 1 8 through 
guard transfer circuitry 112 to the required destination. 
The transfer of the correct guard value will occur when 
the data unit 18 executes in its pipeline operation the 45 
"sendguard" instruction and the guard value which is 
transferred to either the address unit 19 or the data 
memory inlerface control 50 wilt need to be held in a 
queue ready for continued execution of the instruction 
once the stall is terminated. The transferred guard value so 
will be held in an A-IDQ guard queue 113 if the guarded 
instructibn was stalled in the IDQ 41 . If the stall was in 
tho ARLQ qucuo 111 thon the transforrod guard value 
will be held in an ARLQ guard queue 114. In th case 
of a store instruction where the store address had been 55 
added to the SAQ 71 in the interface control 50, the 
guard value will be transferred from circuitry 112 to ah 
SAQ guard queue 1 i 5 in the data memory interface con- 



trol 50 so that the memory access may be implemented 
in accordance with the entry in the SAQ 71 if the guard 
value transferred permits this. It will be seen that by this 
provision, the address unit can execute a memory store 
instruction as far as identifying the required store ad- 
dress arid adding that address to a queue in the inter- 
face 50 prior to checking whether or not the guard value 
of the store instruction is true or false. The store opera- 
tion will* be held in the queue 71 until the guard value is 
checked £hd will only proceed to completion if the guard 
value is true. In each case where the guard value is 
transferred to the address unit 19 ffdm the data unit 18, 
the stalled instruction for the address unit or general unit 
21 will be resumed or rendered inoperative depending 
on the guard value transferred from the data unit file 27. 
[0P27] ! The Use of the guard queues 113, 114and 115 
alloW resynchronisatidK" ol the guard values with the mi- 
croinstruction that caiised the request "sendguard" 110 
to be sent tolhe data unit 18. The above description for . 
operation of a guarded store instruction indicated'how 
the effective store address could be put oh the queue 
71 prior to resolving the guard value! The address unit 
1 9 may be operated with an earlier stall in the execution 
of a store instruction so that the effective address is not 
calculated and fed to tho interface control 50 until after 
the guard value has been transferred and resolved. Sim- 
ilarly a guarded load instruction may be executed by the 
address un ft 1 9 to access the memory and obtain the 
required data for addition to the load data queue 60 prior 
to resolving the guard value. Alternatively the address 
unit may cause an earlier stall awaiting resolution of the 
guard value transferred from the data unit prior to ob- 
taining the data from the memory and putting it into the 
queue 60. In the case where the data is obtained from 
trre memory and put onto the load data queue 60 prior 
to resolution of the guard value, the appropriate register 
file 26, 40 or 22 is "updated by a load operation from the 
load data queue 60 only if the guard value is found to ; 
be true. In the case of a false guard value, the register * 
files are not updated and the appropriate execution unit 
effects a read of the load data queue 60 to remove the 
unwanted data from the queue without updating any 
destination register file. 

[0028] As indicated above, the guard values to be as- 
signed to the guard indicators G0-G14 can be modified 
by executing a guard modifier instruction. These can be 
executed by either the data unit 18 or the address unit 
19. In the case of a guard modifier instruction executed 
by the data unit 1 8, the new guard value is calculated in 
the execution pipeline of the appropriate slot 0 or slot 1 , 
of the data unit 1 8 and, whichever ofthese pipelines has 
been used, it will cause a hew value of the guard to be 
writton into tho guard register filo 27 at tho end of tho 
execution pipeline in order to update the register file 27. 
Th guard modifi r instructions may themselv s b 
guarded. " ■"'*...*' 
[0029] In the case of the guard modifier instruction 
which is supplied to the address unit 1 9, the new value 
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will be calculated on execution of the instruction in either , 
slot 0 or slot 1 pipeline of the address unit 19 and the 
new value can be written into the address guard register 
file 1 00 of the address unit 1 9 at the end of the execution 
pipeline. However it is necessary for the new guard val- 
ue to be transferred to the master guard register file 27 
of the data unit 18 in order to update that master file at 
the correct cycle of operation. To achieve this, the con- 
trol unit 1 2 arranges to supply a "receive guard" instruc- 
tion to the data unit 18 simultaneously with the supply 
of any microinstruction to the address unit 1 9 which will" 
cause a new guard value to be calculated by the address, 
unit 19. This "receive guard" instruction is illustrated at 

116 in the drawing and is added" to the instruction dis- 
patch queue 29 of the data unit 18 in the same cycle as 
the microinstruction generating a new guard value is 
supplied to the address unit 19. As soon as the execu'r 
tion pipeline in the address unit 19 has calculated the 
new guard value it is supplied to a transfer FIFO 117 to 
enable supply of the new guard value to the data unit 

18. The "receive guard" instruction supplied to the data 
unit 1 8 is treated by the data unit 1 8 'in the same way as 
a guard modifier instruction except that it does not need 
to calculate the new guard value but merely obtains it . 
from the FIFO 117. 

[0030] The microinstruction generating the guard val- 
ue is supplied to the IDQ 41 of the address unit 1 9 in the 
same cycle as the "receive guard" microinstruction is fed 
to the IDQ 29 of the data unit 1 8. As explained above, 
synchronisation is achieved between the two units so. 
that the new guard value transferred through the FIFO 

117 updates the master register file 27 at the correct 
point in the sequence of the instruction execution. , 
[0031] It is possible to execute a load GR instruction 
in order to load from memory 20 into the guard register 
files 27 and 100 of the data unit 18 in the address unit 

1 9. This instruction is treated as a load data register and 
a load pointer register instruction and the correct value 
is obtained from memory and put into the load data 
queues 60. The transfer from the load data queues 60 
to the address unit 1 9 and data unit 18 occurs independ- 
ently and the data is maintained in the load data queue 
60 until both units have read the appropriate data. The. 
microinstructions in the instruction dispatch queues 29 
and 41 necessary to load the guard registers 27 and 1 00 
are stalled until the execution units of both the data unit 
18 and the address unit 19 is empty of any guard mod- 
ifier instructions. At trial lime the load guard register in- 
structions can proceed to transfer the required value 
from the load data queue 60 into the guard register file 
without conflict with any other guard modifier instruc- 
tions. 

[0032] It is apparont that when tho data unit 18 re- 
ceives a sendguard microinstruction, th corr ct guard 
value should b sel cted and passed to the requ sting 
xecution unit as soon as possible to avoid unnecessary 
delays A mechanism is described in the following which 
achieves this without unnecessary stalls in the data unit 



18. . 

[0033] The instruction data queue 29 of the data unit 
is implemented as a FIFO having a plurality of storage 
locations, each of which, holds two microinstructions. 

s (slot 0 and slot 1 ). Reference is made to Figure 2 which 
illustrates the main instruction FIFO 29 as having four 
storage locations LO to L3, although in fact more than 
four would normally be provided. To implement the 
sendguard mechanism, a separate sendguard queue 

10 implemented as a FIFO 200 is provided for holding the 
sendguard (sndg) fields of any sendguard microinstruc-. 
tions written to the IDQ 29, each sndg field being asso-' 
ciated with a sendguard tag sndg_tag. A write event 
counter 202 is incremented by 2 each time two new mi- 

*5 constructions are written to the main instructional FO 
29. It includes an overflow bit wr^ovfl to take into ac- 
count circular pointer effects. A read-event counter 204 
counts the number of microinstructions read into the da- 
ta unit pipelines 13.14. This may be two in each cycle 

20 or, where a horizontal slot dependency has been re- 
solved, only one (i.e. a split sequence). 
[0034] A gmLtag register 206 .holds a gmijag for 
each guard indicator. As already mentioned, there are 
fifteen guard indicators GO to G1 4 with assigned guard 

25 : values although only three such tags are shown in Fig- 
ure 2 labelled guard0_tag,guard1_tag etc. Each tag has 
six bits., Four. bits" represent a write. count which is a value 
read from the write counter 202 (plus 0 for slot 0, plus 1 
for slot 1) whenever a guard modifier instruction affect-. 
. 30 ing that guard indicator is written to the FIFO 29.. One 
bit is an aux_gmi_tag bit which is the overflow bit wr_ovfl 
from the write counter 202. the remaining bit is an active 
gmi bit act_gmi which indicates whether there is a valid 
guard modifier instruction (GMI) stored in the main in- 

35 structipn FIFO 29 affecting that guard indicator. This bit 
is set tp one. if a new GMI affecting that guard indicator 
is written to the main instruction FIFO 29 where there 
was not one before or if there remain any active GMis 
affecting that guard* indicator in the main .instruction 

40 FIFO 29. 

[0035] Each time a sndg instruction is written to the 
sndg queue 200, the gmLtag value for the guard indi- 
cator referred to in the sndg instruction is written to the 
sndgjag associated with the sndg field is the sndg 

45 queue 200. Thus this indicates the symbolic slot position 
of the youngest GMI in the main instruction FIFO 29 
modifying the requested guard indicator. . . 
[0036] The purpose of the separate sndg queue is to 
proyide a bypass" feature into the data unit pipelines 

50 1 3, 1 4 so that as soon as a sndg instruction arrives it can 
be supplied to one of the pipelines 13,14 as soon as 
possible without unnecessary delays. However, it is im- 
portant that it is not supplied to a pipeline jn a cycle oar v 
lier than a GMI aff cting that guard indicator which was 

55 xpect. dtobe x cuted earlier according to the instruc- 
tion, sequenc in the original program. To make sur that' 
this does not happen, a bypass controller 208 compares 
the. read event count with the sndg tag at the head of 
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the sndg queue 200 and the sndg instruction is supplied. . 
to the pipeline only if read-event counter :> sndg-tag. 
Otherwise the next instruction in the FIFO 29 is fetched 
into the execution pipeline 1 3 or 1 4. . 
[0037] A mechanism will now be described with refer- 
ence to Figure 3 which avoids stalls, of sendguafd in- 
structions which enter the data unit pipeline 13 or 14. In 
order to achieve this, it is necessary to' know the depend- 
ency between a sendguard instruction despatched into 
the data unit pipeline 13 or 14 and any guard' modifier 
instructions affecting the value oMhe guard to be sent. * 
Any such dependencies are resolved by the vertical de- 
pendency checker 30 for the data unit pipeline's. .Signals 
from the vertical dependency checker are despatched 
into the data tinit pipelines to control multiplexors de- 
scribed hereinafter to avoid stalling regardless of guard 
dependencies. ^ c l \ 

[0038] The vertical dependency checker 30 uses in- " 
formation provided in the fields of the r^icrdinstructidns 
supplied to the data unil Figure 4 illustrates the relevant' 
fields of a microinstruction. It'wifi be appreciated tha| the 
microinstruction contains a" number of otfjerfietds, but 
these are not included herein because they (do hot affect 
the dependency resolution. For guard modifieMnstruc; 
tiohs, the rolovant fields arc dg-v which is equal to pine"" 
when a guard is to be modified/ anfd dg-guard [' which 
comprises four bits identifying which" of the ' sixfeeh ; 
guard' 'indicators is to be modified. ' : ' " ' 1 J ' ^ \ 
[0039] the fields relevant to sendguard' instructions " 
are sndg-v which is ecjual to one' when a guafcJ is to be" 
sent and shdg-guard which identifies;^hicrt ^^guard tHdj- 
catorvalue is to be sent. ' ' ' . ' . 
[0040] ' As has briefly been 'explained above, eadri of 
the data unit pipelines comprises' a data qperadd fetdh 
stage 210, a first execution' stage EXV 21 2, a sec6nd' 
execution stage EX2 214 and a data writeback ^tage ' 
DWB 216. instructions and data are propagated through 
the pipeline oh successive machine cycles, such that 
each pipeline stage represents operations in each cycle,. 
Figure 3 does not illustrate the components of the pipe- 
line which are necessary for the normal execution of in- 
structions, nor does it show the interaction of the pipe- ' 
line with the data register file 26. The required fields of 
the microinstructions are used at each appropriate 
stage in the pipeline and in particular the GMI and send- 
guard fields. The GMI fields dg-v and dg-guard are re- 
ceived in the DOF stage and are propagated through 
the subsequent first and second execution stages EX1 
and EX2 as the instruction is executed. As a GMI in- 
struction might itself be guarded, if it Occurs thaMhe 
guard indicator on which the GMI instruction is guarded 
is false, the field dg-v is unset and continues to propa- 
gate in the pipo with tho unsot value of zoro. The de- 
pend ncies between the sendguard instructions to be 
ex cuted and any GMI instructions currently in the pipe 
can be r solved by using simple comparators with re- 
spect to the GMI fields and the sendguard fields. For 
example for a sendguard instruction in the DOF stage 
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of the pipeline, a comparison is made between its cor- 
responding sendguard fields sndg-v and sndg-guard 
and the information available in subsequent stages of 
the pipeline EX1 , EX2 relating to a possible GMI instruc- 
tion, ahead in the pipeline. For example, if a guard mod- 
ifier instruction affecting the guard indicator GO is in the 
first execution stage EX1 , and the sendguard instruction 
referring to. GO isjri the dafe operand fetch stage, a spe- 
cific; signal is, set indicating thatsuch a dependency ex- 
ists. These signals are set by the vertical dependency 
checking unit 30 and are used tp.control multiplexors in 
the' pipeline as described later. ^ 
[0041] Circuitry for executing shdg instructions is 
shown in dotted outline referenced 21.9 in DOF stage 
210 and 221 in E>C1 i stage 212. These sendguard exe- 
cutiori units 219,212 receive sndg opcode from the 
sendguard instruction. "Guard indicators for addressing 
the guard register file 27 are alsjo supplied with the send- 
, guard instruction. It will be appreciated that the pipeline 
includes other units, omitted for the sake pf clarity. Fig- 
ure 3 is merely a diagrammatic representation of the 
pip-eline stages, and additionally illustrates circuitry pro- 
viding a mechanism for avoiding stalls of sendguard in- 
f . struct ions! The circuitry comprises a first selection mul- 
tiplexor 21 8 located in tho data operand fetch stage 210, 
a second selection multiplexor 2£0 located in the 'first 
execution stage 212 and a guard selection multiplexor 
222 which [.can propagate the correct guard value in ei- 
. the? the data operand fetch stage or the first execution 
stage as described in the v f bib wing. A computation unit 
224 allows guard values to be computed responsive to 
guard modifier instructions in the second execution 
stage EX2 214. The output of t!he computation unit 224 
is supplied to the guard ? register file 27 via a guard value 
" output multiplexor 22& The guard register file 27 is ad- 
dressable via the guard indicators in a manner not 
shoWn in Figure 3".such that the computed' guard value 
is written against the'coYrect guard indicator, fh each cy- 
. cle, for each guard indicator either the existing value is 
written back into the guard register file via the feedback 
loop 228, or the newly modified guard value is written to 
the guard register file for the particular guard indicator. 
The mechanisriri operates as follows! 
[0042] When there is no guard dependency, that is 
when there are no earlier instructions in 1( the pipeline 
which imply the modification of .the guard value which 
has been requested to be sent by tfie sendguard instruc- 
tion which has just entered the pipeline, the first selec- 
tion multiplexor 218 is controlled to supply the architec- 
tural guard value from the guard register file 27 and the 
guard selection multipiexor'222 outputs this guard value 
in the data operand fetch stage of the pipeline. Thus, 
only a single cycle is.roquircdto deal with the sondguard 
instruction and to despatch the correct guard valu . 
[0043] When there is an earlier instruction in th pipe- 
lin Ibcat d two cycles ahead which modifies th guard 
.value requested to be sent by the sendguard instruction, 
this guard value is computed in the second execution 
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stage of th pipe I in as th sendguard instruction is r - 
ceived in the data operand fetch stage. The just com- 
puted guard value is supplied to the first selection mul- 
tiplexor 21 8 and is selected through to the guard selec-, 
tioh multiplexor 222. Thus, this also' allows the correct 
guard va|ue to be despatched in the first cycle of exe- 
cution of the sendguard instruction. 
[0044] The mechanism is slightly more complicated 
in the case where a guard modifying instruction is locat- 
ed one cycle ahead of the sendguard instruction and is 
modifying the requested guard value. In this case, the 
sendguard instruction is not dealt with in the data oper- 
and fetch stage, but is propagated though to the first 
execution stage EX1. In the same machine cycle, the 
guard modifying instruction ahead of it is computed by 
the computation unit 224 in the second execution stage 
EX2 and the just computed value is returned to the sec- 
ond selection multiplexor 220. In this case therefore, the 
guard selection multiplexor 222 despatches the output 
of the second selection multiplexor 220 rather than the 
output of the first selection multiplexor 218 thereby al- 
lowing the sendguard instruction to be dealt with in two 
cycles. Although this is slower than a single cycle 
achieved in the first two cases, the performance is still 
high because no stalls in the pipoli.no have occurred. 
[0045] By including a guard value store 230 atthe out- 
put of the data operand fetch stage, it is also possible 
to deal with two sendguard instructions entering the 
pipeline one after the other immediately following a 
guard modifier instruction. The first sendguard instruc- 
tion which has the guard modifier instruction just in front 
of it by one cycle is dealt with as just described. That is, 
it is propagated through to the first execution stage EX1 
so that the guard value can be computed in the compu- 
tation unit 224 in the second execution stage EX2. The 
just computed guard value is then despatched as al- 
ready described. While this is happening it is however 
possible to deal with the second sendguard instruction 
in the data operand fetch stage. If the sendguard instruc- 
tion relates to a guard value not affected by the guard 
modifier instruction, it can be dealt with in the normal 
way and the value selected from the architectural guard 
register 27 held in the store 230 ready for despatch on 
the subsequent cycle after the just computed guard val- 
ue has been despatched. If the second sendguard in- 
struction relates to the same guard indicator which has 
been modified by the earlier guard modifier instruction, 
the value which is held in the store 230 is the just com- 
puted value which is returned back to the first selection 
multiplexor 218 in the data operand fetch stage. This 
allows maximum use of the pipeline stages and avoids 
stalls of the sendguard instructions whatever are the de- 
pendencies. 

Claims 

1. , A computer system for executing instructions hav- 



ing assign d guard indicators, which system com- 
, "prises instruction supply circuitry, a plurality of par- 
allel execution units for receiving respective instruc- 
tions from the supply circuitry, each instruction hav- 
5 ing a respective guard indicator selected from a set 
of guard indicators, common to the plurality of exe- 
cution units, one of said execution units including a 
master guard value store containing a master rep- 
resentation of current values for the guard indica- 
te tors in said set of guard indicators and guard value 
transfer circuitry operable to transfer a guard vajue 
from said master store to another of said execution 
units in response to a sendguard instruction being 
executed in said one execution unit, wherein the in- 
* 5 struction supply circuitry comprises a main instruc- 
tion queue for holding instructions to be supplied to 
. the parallel execution units and a subsidiary instruc- 
tion queue for holding sendguard instructions, with 
the subsidiary queue having priority access to the 
20 execution pipelines to avoid unnecessary delays for 
execution ,of sendguard instructions. 

2. A. computer system according to claim 1 in which 
said execution unit includes circuitry for executing 
2S guard value modifying instructions which aro hold 
in the main instruction queue and for updating said 
, master guard value store with any modified guard 
■. value. ... , 

30 3. A computer system according to claim 2 in which 
the instruction supply circuitry comprises a control 
unit common to each parallel execution unit, said 
control unit including queue checking circuitry 
which is operable to.check, before each sendguard 

35 . instruction is supplied to the execution pipeline, that 
. no earlier guard value modifying instructions affect- 
ing the guard value requested by the sendguard in- 
struction is still waiting in the. main instruction 
queue. 

40 

4. A computer system accordin g to any one of the pre- 
ceding claims in which each execution unit compris- 
es a plurality of pipelined stages with synchronised 
pipelined cycles for each of the execution units. 

45 

5. A computer system according to claims 2 and 4, 
wherein the circuitry for executing guard value mod- 
ifying instructions is located in a pipelined stag 
downstream of an earlier pipelined stage operable 

50 r to execute sendguard instructions. 

6. A computer system according to claim 5, compris- 
ing switching circuitry in the earlior pipelined stage 
for supplying a guard valu which has been modi- 

ss . fied by a guard value modifying instruction in the 
subsequent pipelined stage in a machine cycle re- 
sponsive, to a sendguard instruction in the earlier 
. pipelined stage in the same machine cycle. 
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7. " A computer system according to claim i, in Which 

each execution unit comprises first, second and 
third pipelined stages wherein the first and second 
pipelined stages 1 contain circuitry for executing 
sendguard instructions, and the third pipelined s 
stage includes circuitry for executing guard value 
modifying instructions. 

8. A computer system according to claim 7, Which 
comprises switching circuitry for selecting, respon- 10 
sive to a sendguard instruction, whether a guard 
value held in the master "guard value store or a 
guarc* value just modified by a guard value modify- 
ing instruction is despatched via the guard value 
transfer circuitry. j: is 

9. A computer system according to ! any one' of the pre- 
ceding claims in which said execution unit Incfude 
execution pipelines providing access' \6 a data 
memory, said pipelines including a first set of pipe- 20 
lines for use in executing instructions heeded for 
memory access operations and a second set of 
pipelines arranged to carry 6ut : arithmetic opera- 
tions, thereby providing decoupling of rneVhory ac- 
cess operations from arithmetic operations: * ; 25 

10. A computer system' according'Yd claim ^9 in 'Which 
two parallel pipelines are provided for arithmetic op- 
erations and access a common set of data registers 
including said master guarcJ value ; store* : " : " " 30 

11. A computer system'accbrding to any one of trie pre- 
ceding claims in which raid h /h^er^u£n^;vaiue 
store comprises a register file. ' ; ; ' 

12. A method of executing instructions in a' computer 
system, said instructions having assigned guard in- 
dicators, which method comprises supplying a plu- 
rality of instructions to parallel execution units, each 
instruction having a respective guard indicator se- *o 
lected from a set of guard indicators common to the 
plurality of execution units, holding a master set of 
current values for the guard indicators in one exe- 
cution unit, effecting a transfer of the master guard 
value from said one execution unit in response to a 
sendguard instruction being executed in said one 
execution unit wherein instructions to be supplied 

Lo the parallel execution unil, save 1 for said send- 
guard instructions, are held in a main instruction 
queue and wherein said sendguard instructions are so 
held in a subsidiary queue having priority access to 
the execution pipelines to avoid unnecessary de- 
lays for execution of sbndguard instructions. 

13. A method according to claim 12, including xecut- ss 
ing instructions to modify said guard values wher in 
execution of ah instruction to modify a guard value 
updates said master value, said guard value modi- 



: fying instructions being held in the main instruction 
" queue. 

14. A method according to cjaim 13, which comprises 
the step of checking, before each sendguard in- 
struction is supplied to the execution pipeline, that 
no earlier guard value modifying instruction affect- 

* ing the guard value requested by the sendguard in- " 
structidn is still waiting in the main instruction 
; queue. ' . \ 

15. A pipelined execution unit' for a computer system 
which comprises at ieast two pipelined stages, with 
an earlier one of the pipelined stages including 
sendguard circuitry responsive to a sendguard in- 
struction to despatch a guard value for a guard in- 
dicator defined In the sendguard instruction, arid a 
later one of the pipelined stages including guard val- 

'" ue modifying, circuitry for executing guard value 
modifying instructions' which cause the value of a 

" guard indicaior to be "modified, the pipelined execu- 
tion unit further comprising: 

a master guard value store for holding a master 
Vcprospntation of current values 'ff of guard indi- 
"' cators; and 
means for determining any dependencies be- 
ri " tween a seridgfuard instruction in the earlier 
^ pipelined stage and a guard modifying instruc- 
tion in the later pipelined stage; and 
r switching circuitry for selecting when a send 
guard instruction is executed and responsive to 
any such dependencies, whether a guard value 
held in the master guard value store or a guard 
- : value just rriWifled by the guard value modify- 
ing circuitry is to be despatched. 

16. A pipelined execution unit according to claim 15, 
which comprises first, second and third pipelined 
stages wherein the first and second pipelined stag- 
es each contain such "circuitry for executing send- 

! guard instructions, and the third pipelined stage in- 
cludes such circuitry for executing guard value 
' modifying instructions. 

17. A pipelined execution unit according to claim 15 or 
16, which includes a data write-back stage for writ- 
ing back the results of execution of an instruction 
into a data store. 

18. A method of executing instructions in a pipelined ex : 
ecution unit, each instruction having a respective 
guard indicator soloctod from a sot of guard indica- 
tors and ex cut ion of the instructions being predi- 
cated on the values of the guard indicators, wherein 
the instructions include a sendguard instruction 
which, when executed, causes the transfer of a 
guard value from thepipelined execution unit and a 
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guard value modifying instruction which rnodifl s 
the value of a guard indicator, the method compris- 
ing: 

supplying instructions to the pipelined execu- s 
tion unit including said sendguard instructions 
and said guard value modifying instructions; 
checking dependencies between guard value 
modifying instructions supplied to the pipelined 
execution unit earlier than a sendguard instruc- 10 
tion relating to the same guard indicator, and 
supplying the guard value of the guard indicator 
requested in the sendguard instruction selec- 
tively from a master guard value store or guard 
value modifying circuitry jn dependence on the is 
results of said dependency checks, so as to en- 
sure that the guard value of a guard indicator 
which is despatched responsive to a sendguard 
instruction is correct in relation to any earlier 
guard value modifying instructions in the pipe- 20 
lined execution unit. 
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